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We investigate the quantization of non-zero sum games. For the particular case of the Prisoners' 
Dilemma we show that this game ceases to pose a dilemma if quantum strategies are allowed for. 
We also construct a particular quantum strategy which always gives reward if played against any 
classical strategy. 
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One might wonder what games and physics could have 
possibly in common. After all, games like chess or poker 
seem to heavily rely on bluffing, guessing and other ac- 
tivities of unphysical character. Yet, as was shown by 
von Neumann and Morgenstern conscious choice is 
not essential for a theory of games. At the most ab- 
stract level, game theory is about numbers that entities 
are efficiently acting to maximize or minimize For a 
quantum physicist it is then legitimate to ask what hap- 
pens if linear superpositions of these actions are allowed 
for, that is if games are generalized into the quantum 
domain. 

There are several reasons why quantizing games may 
be interesting. First, classical game theory is a well 
established discipline of applied mathematics Q which 
has found numerous applications in economy, psychology, 
ecology and biology 1^,1). Since it is based on probabil- 
ity to a large extend, there is a fundamental interest in 
generalizing this theory to the domain of quantum prob- 
abilities. Second, if the "Selfish Genes" jsj are reality, we 
may speculate that games of survival are being played al- 
ready on the molecular level where quantum mechanics 
dictates the rules. Third, there is an intimate connection 
between the theory of games and the theory of quan- 
tum communication. Indeed, whenever a player passes 
his decision to the other player or the game's arbiter, 
he in fact communicates information, which ~ as we live 
in a quantum world - is legitimate to think of as quan- 
tum information. On the other hand it has recently been 
transpired that eavesdropping in quantum-channel com- 
munication 1^-^ and optimal cloning Q can readily be 
conceived a strategic game between two or more play- 
ers, the objective being to obtain as much information as 
possible in a given set-up. Finally, quantum mechanics 
may well be useful to win some specially designed zero- 
sum unfair games, like PQ penny flip, as was recently 
demonstrated by Meyer Q , and it may assure fairness in 
remote gambling Q. 

In this letter we consider non-zero sum games where 
- in contrast to zero-sum games - the two players no 
longer appear in strict opposition to each other, but 
may rather benefit from mutual cooperation. A par- 
ticular instance of this class of games, which has found 
widespread applications in many areas of science, is the 



Prisoners' Dilemma. In the Prisoners' Dilemma, each of 
the two players, Alice and Bob, must independently de- 
cide whether she or he chooses to defect (strategy D) 
or cooperate (strategy C). Depending on their decision 
taken, each player receives a certain pay-off - see Tab. 
0. The objective of each player is to maximize his or 
her individual pay-off. The catch of the dilemma is that 
D is the dominant strategy, that is, rational reasoning 
forces each player to defect, and thereby doing substan- 
tially worse than if they would both decide to cooperate 
pO| . In terms of game theory, mutual defection is also a 
Nash equilibrium ^ : in contemplating on the move DD 
in retrospect, each of the players comes to the conclusion 
that he or she could not have done better by unilaterally 
changing his or her own strategy [ pl| . 

In this paper we give a physical model of the Pris- 
oners' Dilemma, and we show that - in the context of 
this model - the players escape the dilemma if they both 
resort to quantum strategies. Moreover, we shall demon- 
strate that (i) there exists a particular pair of quantum 
strategies which always gives reward and is a Nash equi- 
librium and (ii) there exist a particular quantum strategy 
which always gives at least reward if played against any 
classical strategy. 

The physical model consists of (i) a source of two bits, 
one bit for each player, (ii) a set of physical instruments 
which enable the player to manipulate his or her own bit 
in a strategic manner, and (iii) a physical measurement 
device which determines the players' pay-off from the 
state of the two bits. All three ingredients, the source, the 
players' physical instruments, and the pay-off physical 
measurement device are assumed to be perfectly known 
to both players. 

TABLE I. Pay-off matrix for the Prisoners' Dilemma. The 
first entry in the parenthesis denotes the pay-off of Alice and 
the second number is Bob's pay-off. The numerical values 
are chosen as in Referring to Eq. (^) this choice corre- 
sponds to r = 3 {"reward"), p — 1 {"punishment"), t = 5 
{"temptation"), and s = {"sucker's pay-off"). 
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The quantum formulation proceeds by assigning the 
possible outcomes of the classical strategies D and C two 
basis vectors \D) and |C) in the Hilbert space of a two- 
state system, i.e., a qubit. At each instance, the state of 
the game is described by a vector in the tensor product 
space which is spanned by the classical game basis |CC), 
\CD), \DC) and \DD), where the first and second entry 
refer to Alice's and Bob's qubit, respectively. 

The board of our quantum-game is depicted in Fig. ||; 
it can in fact be considered a simple quantum network 
[ p^ with sources, reversible one-bit and two-bit gates and 
sinks. Note that the complexity is minimal in this im- 
plementation as the players' decisions are encoded in di- 
chotomic variables. 
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FIG. 1. The setup of a two player quantum game. 

We denote the game's initial state by Ii/jq) = J\CC) , 
where ^7 is a unitary operator which is known to both 
players. For fair games must be symmetric with re- 
spect to the interchange of the two players. The strate- 
gies are executed on the distributed pair of qubits in the 
state Strategic moves of Alice and Bob are asso- 

ciated with unitary operators Ua and Ub, respectively, 
which are chosen from a strategic space S. The inde- 
pendence of the players dictates that Ua and U b operate 
exclusively on the qubits in Alice's and Bob's posses- 
sion, respectively. The strategic space S may therefore 
be identified with some subset of the group of unitary 
2x2 matrices. 

Having executed their moves, which leaves the game 
in a state (C/4 (g) U b)J\CC) , Alice and Bob forward their 
qubits for the final measurement which determines their 
pay-off. The measurement device consists of a reversible 
two-bit gate J which is followed by a pair of Stern Ger- 
lach type detectors. The two channels of each detector 
are labeled by cr = C, D. With the proviso of subsequent 
justification we set J — j\ such that the final state 
IV'/) — \'4'f{UA,UB)) of the game prior to detection is 
given by 



\^f) = (Ua Ub) J\CC) . 



(1) 



The subsequent detection yields a particular result, 
aa' = CD say, and the pay-off is returned according 
to the corresponding entry of the pay-off matrix. Yet 
quantum mechanics being a fundamentally probabilistic 
theory, the only strategic notion of a pay-off is the ex- 
pected pay-off. Alice's expected pay-off is given by 



rPcc + pPoD + tPoc + sPcD 



(2) 



where P^a' = I P is the joint probabihty that the 

channels a and a' of the Stern- Gerlach type devices will 



click. Bob's expected pay-off is obtained by interchang- 
ing t <-!■ s in the last two entries (for numerical values of 
r, p, t, s see Tab. ^. Note that Alice's expected pay-off 
$A not only depends on her choice of strategy Ua, but 
also on Bob's choice Ub- 

It proves to be sufhcient to restrict the strategic space 
to the 2-parameter set of unitary 2x2 matrices 



U{9,^)^ 



e''^cos6'/2 sin 6*72 



-sin 6*72 



cose/2 



(3) 



with < 6* < TT and < </> < 7r/2. To be specific, we 
associate the strategy "cooperate" with the operator 



C'=C/(0,0), C 



1 
1 



(4) 



while the strategy "defect" is associated with a spin-flip, 

b^U{^,Q), b=(^\lY (5) 

In order to guarantee that the ordinary Prisoners' 
Dilemma is faithfully represented, we impose the sub- 
sidiary conditions 

[J ,b ® b] ^ Q ,[j ,b ® c] ^ Q ,[j ,c ® b] = . (6) 

These conditions together with the identification J — j'^ 
imply that for any pair of strategies taken from the sub- 
set 5o = {^7(6*, 0)1 Q e [0, tt]}, the joint probabilities P^a' 
factorize, P^a' = P'a'p'b ^ where p'*^^ ~ cos^(6'/2) and 
p'-^' = 1 — p'^'~^\ Identifying p'^'-^^ with the individual 
preference to cooperate, we observe that condition (|^) 
in fact assures that the quantum Prisoners' Dilemma en- 
tails a faithful representation of the most general classical 
Prisoners' Dilemma, where each player uses a biased coin 
in order to decide whether he or she chooses to cooper- 
ate or to defect jisj. Of course, the entire set of quantum 
strategies is much bigger than 5o, and it is the quantum 
sector S\Sq which offers additional degrees of freedom 
that can be exploited for strategic purposes. Note that 
our quantization scheme applies to any two player bi- 
nary choice symmetric game and - due to the classical 
correspondence principle Eq. - is to a great extent 
canonical. 

Factoring out Abelian subgroups which yield nothing 
but a reparametrization of the quantum sector of the 
strategic space S, a solution of Eq. (||) is given by 



J ^eyi^{i-lb®b/2}, 



(7) 



where 7 £ [0, 7r/2] is a real parameter. In fact, 7 is a mea- 
sure for the game's entanglement. For a separable game 
7 = 0, and the joint probabilities Po-o-' factorize for all 
possible pairs of strategies Ua, Ub- Fig. ^ shows Alice's 
expected pay-off for 7 = 0. As can be seen in this figure, 
for any of Bob's choices U b Alice's pay-off is maximized 



2 



if she chooses to play D. The game being symmetric, the 
same holds for Bob and D®D\s the equilibrium in dom- 
inant strategies. Indeed, separable games do not display 
any features which go beyond the classical game. 
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FIG. 2. Alice's pay-off in a separable game. In this and 
the following plot we have chosen a certain parametrization 
such that the strategies Oa and Ub each depend on a single 
parameter t £ [—1, 1] only: we set Ua = U{tTr,0) for t £ [0, 1] 
and Ua = U{0,-tn/2) for t £ [-1,0) (same for Bob). De- 
fection D corresponds to the value t = 1, cooperation C to 
t = 0, and Q is represented by t = — 1. 

The situation is entirely different for a maximally en- 
tangled game 7 = tt/2. Here, pairs of strategies exist 
which have no counterpart in the classical domain, yet 
by virtue of Eq. (|^) the game behaves completely clas- 
sical if both players decide to play = 0. For example. 
Pec = I cos((/)yi-}-0B) cos(^a/2) cos(0b/2)P factorizes on 
So So (i.e., <j)A — (t^B = fixed), but exhibits non-local 
correlations otherwise. In Fig. ^ we depict Alice's pay-off 
in the Prisoners' Dilemma as a function of the strategies 
Ua, Ub- Assuming Bob chooses D, Alice's best reply 
would be 




while assuming Bob plays C Alice's best strategy would 
be defection D. Thus, there is no dominant strategy left 
for Alice. The game being symmetric, the same holds for 
Bob, i.e., D®D is no longer an equilibrium in dominant 
strategies. 

Surprisingly, D (g) D even ceases to be a Nash equi- 
librium as both players can improve by unilaterally de- 
viating from the strategy D. However, concomitant 
with the disappearance of the equilibrium D ® D a, 
new Nash equilibrium Q ® Q has emerged with pay-off 
%a{Q,Q) = %b{Q,Q) = 3. Indeed, $^(^7(0, 0), Q) = 
cos2(6'/2) (3sin2(?!)-|-cos2 0) < 3 for aU 9 e [0, tt] and 

<f> e [0,7r/2] and analogously %b{Q,Ub) < $b{Q,Q) for 
all [/s e S' such that no player can gain from unilaterally 
deviating from Q ® Q. It can be shown that Q ® Q \s 
a unique equilibrium, that is, rational reasoning dictates 
both players to play Q as their optimal strategy. 

It is interesting to see that Q ® Q has the property 



to be Pareto optimal 1^, that is, by deviating from this 
pair of strategies it is not possible to increase the pay-olf 
of one player without lessening the pay-off of the other 
player. In the classical game only mutual cooperation 
is Pareto optimal, but it is not an equilibrium solution. 
One could say that by allowing for quantum strategies 
the players escape the dilemma p^ ]. 

The alert reader may object that - very much like any 
quantum mechanical system can be simulated on a clas- 
sical computer - the quantum game proposed here can 
be played by purely classical means. For instance, Alice 
and Bob each may communicate their choice of angles 
to the judge using ordinary telephone lines. The judge 
computes the values Pcrcr' , tosses a four-sided coin which 
is biased on these values, and returns the pay-off accord- 
ing to the outcome of the experiment. While such an 
implementation yields the proper pay-off in this scenario 
four real numbers have to be transmitted. This contrasts 
most dramatically with our quantum mechanical model 
which is more economical as far as communication re- 
sources are concerned. Moreover, any local hidden vari- 
able model of the physical scheme presented here predicts 
inequalities for Pa-a', as functions of the four angles 6a, 
9b, 4>a, and 4>b, which are violated by the above ex- 
pressions for the expected pay-off. We conclude that in 
an environment with limited resources, it is only quan- 
tum mechanics which allows for an implementation of the 
game presented here. 
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FIG. 3. Alice's pay-off for a maximally entangled game. 
The parametrization is chosen as in Fig. U . 

So far we have considered fair games where both play- 
ers had access to a common strategic space. What hap- 
pens when we introduce an unfair situation: Alice may 
use a quantum strategy, i.e., her strategic space is still 
5, while Bob is restricted to apply only "classical strate- 
gies" characterized by = 0? In this case Alice is well 
advised to play 

M = U{n/2, 7r/2), A> = ^ ( \ \ ) , (9) 

(the "miracle move"), giving her at least reward r — Z 
as pay-off, since %a{M ,tj[0,Q)) > 3 for any 9 S [0,7r], 
leaving Bob with $b(M, ?7(6', 0)) < 1/2 (see Fig. | (a)). 
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Hence if in an unfair game Alice can be sure that Bob 
plays U{9,0), she may choose "Always- M" as her pre- 
ferred strategy in an iterated game. This certainly out- 
performs tit-for-tat, but one must keep in mind that the 
assumed asymmetry is essential for this argument. 

It is moreover interesting to investigate how Alice's 
advantage in an unfair game depends on the degree of 
entanglement of the initial state |'0o)- The minimal ex- 
pected pay-off TO Alice can always attain by choosing an 
appropriate strategy Ua is given by 

m = max min $AiUA,UB)', (10) 
UAesuB=u{e,Q) 

Alice will not settle for anything less than this quantity. 
Considering to a function of the entanglement parame- 
ter 7 g [0, 7r/2] it is clear that to(0) = 1 (since in this 
case the dominant strategy D is the optimal choice) while 
for maximal entanglement we find m(Tr/2) = 3 which is 
achieved by playing M. Fig. ^ (b) shows to as function 
of the entanglement parameter 7. We observe that to is 
in fact a monotone increasing function of 7, and the max- 
imal advantage is only accessible for maximal entangle- 
ment. Furthermore, Alice should deviate from the strat- 
egy D if and only if the degree of entanglement exceeds 
a certain threshold value 7th = arcsin(l/V5) ~ 0.464. 
The observed threshold behavior is in fact reminiscent of 
a first order phase transition in Alice's optimal strategy: 
at the threshold she should discontinuously change her 
strategy from D to Q. 

(a) (b) 
Pa m 




FIG. 4. Quantum versus classical strategies: (a) Alice's 
pay-off as a function of 6 when Bob plays U {6, 0) (U (0, 0) — C 
and If (71,0) = D) and Alice chooses C (solid line), D (dots) 
or M (dashes), (b) The expected pay-off Alice can always 
attain in an unfair game as a function of the entanglement 
parameter 7. 

Summarizing we have demonstrated that novel fea- 
tures emerge if classical games like the Prisoners' 
Dilemma are extended into the quantum domain. We 
have introduced a correspondence principle which guar- 
antees that the performance of a classical game and its 
quantum extension can be compared in an unbiased man- 
ner. Very much like in quantum cryptography and com- 
putation, we have found superior performance of the 
quantum strategies if entanglement is present. 
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